Add boundary checks and add tests for SparseTensorProtoToDenseTensorProto by yuslepukhin · Pull Request #27323 · microsoft/onnxruntime

yuslepukhin · 2026-02-12T03:57:59Z

Motivation and Context

Description

This pull request improves the robustness and correctness of sparse tensor conversion to dense tensors in the ONNX Runtime codebase. The main changes include enhanced validation of tensor shapes and indices, improved support for different index data types, and better error handling for out-of-bounds indices and invalid shapes.

Validation and Error Handling Improvements:

Added explicit checks to ensure that sparse tensor indices are either rank 1 or 2, and that values are 1D, returning clear errors for unsupported shapes. Also, added checks to ensure all dimension sizes are positive for both values and dense tensors.
Improved error messages and validation for out-of-bounds indices during the sparse-to-dense copy process, including both flattened and multi-dimensional COO indices. [1] [2]

Index Data Type Handling:

Enhanced support for INT8 and INT16 index types by allowing them to be provided in the int32_data field (not just as raw data), and improved handling of index data extraction for all supported types.

API and Internal Consistency:

Refactored the CopySparseData function to take both dense_dims and dense_elements as arguments for clearer logic and consistency. [1] [2] [3] [4]
Updated function documentation to clarify supported formats and expected tensor shapes.

Minor Corrections:

Improved error messages for invalid index shapes and ensured correct usage of ONNX enum values and utility functions. [1] [2]

These changes collectively make sparse tensor handling more robust, user-friendly, and standards-compliant.

Motivation and Context

Make the code more robust

Copilot

Pull request overview

This PR aims to harden SparseTensorProtoToDenseTensorProto in ONNX Runtime by adding validation around sparse indices/shape handling and extending index dtype support, along with new unit tests to cover the behavior.

Changes:

Refactors CopySparseData to take dense_dims and dense_elements, and adds bounds checks for rank-1 (flattened) and rank-2 (COO) indices.
Adds input validation for indices rank, values rank, and (new) dimension constraints.
Adds new unit tests covering multiple index dtypes, raw vs typed index storage, and several invalid-input scenarios.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

File	Description
`onnxruntime/test/framework/tensorutils_test.cc`	Adds coverage for sparse-to-dense conversion across index dtypes and invalid shapes/out-of-bounds cases.
`onnxruntime/core/framework/tensorprotoutils.h`	Updates/extends the API documentation for sparse-to-dense conversion.
`onnxruntime/core/framework/tensorprotoutils.cc`	Implements the refactor and new validation/bounds checking in the sparse-to-dense conversion path.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/core/framework/tensorprotoutils.cc

onnxruntime/core/framework/tensorprotoutils.h

onnxruntime/core/framework/tensorprotoutils.cc

skottmckay

…roto (#27323) ### Motivation and Context ### Description This pull request improves the robustness and correctness of sparse tensor conversion to dense tensors in the ONNX Runtime codebase. The main changes include enhanced validation of tensor shapes and indices, improved support for different index data types, and better error handling for out-of-bounds indices and invalid shapes. **Validation and Error Handling Improvements:** - Added explicit checks to ensure that sparse tensor indices are either rank 1 or 2, and that values are 1D, returning clear errors for unsupported shapes. Also, added checks to ensure all dimension sizes are positive for both values and dense tensors. - Improved error messages and validation for out-of-bounds indices during the sparse-to-dense copy process, including both flattened and multi-dimensional COO indices. [[1]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1827-R1850) [[2]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406R1861-R1863) **Index Data Type Handling:** - Enhanced support for INT8 and INT16 index types by allowing them to be provided in the `int32_data` field (not just as raw data), and improved handling of index data extraction for all supported types. **API and Internal Consistency:** - Refactored the `CopySparseData` function to take both `dense_dims` and `dense_elements` as arguments for clearer logic and consistency. [[1]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1747-R1781) [[2]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1904-R1947) [[3]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1928-R1957) [[4]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1938-R1967) - Updated function documentation to clarify supported formats and expected tensor shapes. **Minor Corrections:** - Improved error messages for invalid index shapes and ensured correct usage of ONNX enum values and utility functions. [[1]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1863-R1872) [[2]](diffhunk://#diff-d31e9fbe0f5334fcd949833e035f2b25d5ae810dcd505c545f6b372b546b1406L1747-R1781) These changes collectively make sparse tensor handling more robust, user-friendly, and standards-compliant. ### Motivation and Context Make the code more robust

This cherry-picks the following commits for the release: - #27323 - #27339 --------- Co-authored-by: Dmitri Smirnov <yuslepukhin@users.noreply.github.com> Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

yuslepukhin added 2 commits February 11, 2026 19:30

Fix issues and add tests

883d509

Add more tests

2944894

yuslepukhin requested review from Copilot and skottmckay February 12, 2026 03:57

Copilot started reviewing on behalf of yuslepukhin February 12, 2026 03:58 View session

Copilot AI reviewed Feb 12, 2026

View reviewed changes

skottmckay reviewed Feb 12, 2026

View reviewed changes

onnxruntime/core/framework/tensorprotoutils.cc Outdated Show resolved Hide resolved

onnxruntime/core/framework/tensorprotoutils.cc Outdated Show resolved Hide resolved

yuslepukhin added 4 commits February 12, 2026 10:57

Address signed/unsiged comparision

a644643

Move tests

efb1454

Fix all zeros

361f22f

Add more tests

a44ac07

yuslepukhin marked this pull request as ready for review February 13, 2026 03:03

yuslepukhin closed this Feb 13, 2026

yuslepukhin reopened this Feb 13, 2026

skottmckay approved these changes Feb 13, 2026

View reviewed changes

yuslepukhin merged commit 1ade88a into main Feb 13, 2026
95 of 166 checks passed

yuslepukhin deleted the yuslepukhin/sparse_proto_conversion branch February 13, 2026 19:00

yuslepukhin added the release:1.24.2 label Feb 13, 2026

tianleiwu mentioned this pull request Feb 13, 2026

ORT 1.24.2 release cherry-pick round 2 #27343

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add boundary checks and add tests for SparseTensorProtoToDenseTensorProto#27323

Add boundary checks and add tests for SparseTensorProtoToDenseTensorProto#27323
yuslepukhin merged 6 commits intomainfrom
yuslepukhin/sparse_proto_conversion

yuslepukhin commented Feb 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

skottmckay left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yuslepukhin commented Feb 12, 2026

Motivation and Context

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

skottmckay left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants